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ABSTRACT 

Context. While photospheric magnetic elements appear mainly as Bright Points (BPs) at the disk center and as faculae near the limb, 
high-resolution images reveal the coexistence of BPs and faculae over a range of heliocentric angles. This is not explained by a "hot 
wall" effect through vertical flux tubes, and suggests that the transition from BPs to faculae needs to be quantitatively investigated. 
Aims. To achieve this, we made the first recorded attempt to discriminate BPs and faculae, using a statistical classification approach 
based on Linear Discriminant Analysis (LDA). This paper gives a detailed description of our method, and shows its application on 
high-resolution images of active regions to retrieve a center-to-limb distribution of BPs and faculae. 

Methods. Bright "magnetic" features were detected at various disk positions by a segmentation algorithm using simultaneous G- 
band and continuum information. By using a selected sample of those features to represent BPs and faculae, suitable photometric 
parameters were identified for their discrimination. We then carried out LDA to find a unique discriminant variable, defined as the 
linear combination of the parameters that best separates the BPs and faculae samples. By choosing an adequate threshold on that 
variable, the segmented features were finally classified as BPs and faculae at all the disk positions. 

Results. We thus obtained a Center-to-Limb Variation (CLV) of the relative number of BPs and faculae, revealing the predominance 
of faculae at all disk positions except close to disk center {/i > 0.9). 

Conclusions. Although the present dataset suffers from limited statistics, our results are consistent with other observations of BPs 
and faculae at various disk positions. The retrieved CLV indicates that at high resolution, faculae are an essential constituent of active 
regions all across the solar disk. We speculate that the faculae near disk center as well as the BPs away from disk center are associated 
with inclined fields. 

Key words. Sun:photosphere - Sun:faculae, plages - Sunmagnetic fields - Methods: statistical - Techniques:high angular resolution - 
Techniques:photometric 



1. Introduction 

When imaged at high spatial resolution, the solar photosphere 
reveals a myriad of tiny bright features, primarily concentrated 
in active regions and outlining the borders of supergranules in 
the quiet Sun. Near disk cen ter, they appear mainl y as "Bright 
Point s" (BPs) or "filigree" (iDunn & Zirked 119731: Mehltretter 
1 19741) . i.e. roundish or elongated bright features located in the in- 
tergranular downflow lanes (Title et al. 1987), partic ularly bright 
when observed in Fraunhofer's G-ban d ( Mulle r & Roudierl 
Il984t iBergeret al.l[l995t iLanghans et an[2002). Near the limb, 
they resemble more side-illuminated granules called "faculae" 
or "facular grains" (e.g. Muller 1975), herein considered as in- 
dividual small-scale elements ( Hirzberger & Wiehr 2005). The 
close association of BPs and faculae with magnetic field in- 
dicators such as chromospheric Ca ll emission suggests tha t 
they are related phenomena (Mehlt retteilll974t IWilson|ll98lh. 
botb a ssociated with small-scale kG flux concentrations J stenflol 
119731) . These so-called "magnetic elements" are considered as 
the basi c building block s of the photos pheric magnetic activ- 
ity (see ISchiisslen 119921: ISolankil Il993l for reviews), whence 
the importance of understanding their fundamental physics. 



Also, much of the interest in faculae has been justified by 
their major role in pro d ucing the total sola r irradiance variation 
(Lean & Foukal 1981 iFligge et al.1 \l99^ IWalton et al.1 llool 
Krivova et al. 2003i). 

The peculiar appearance of BPs and faculae as well as their 
different distribution on the disk raises questions about their 
physical origin and mutual relationship. The standard model ac- 
counting for both these phenomena describes BPs and faculae 
as distinct radiative signatures of strongly evacuated thin flux 
tubes, arising from different viewing angles ("hot-w all" model. 
Spruit 1976: Knolker et al.lll988[ [19911 ISteinedl2005h . This sim- 
plified picture has been verified in its salient points by re cent 
comprehensive 3D MHD simulations ( Vogler et all l2005l) . A 
major success has b een the ability to qualitatively reproduce 
BPs near disk center dSchiissler et alj|2003: .Shelvag et aL.200"4l) 



and faculae closer to the limb (Keller et al. 



l2004ICarlsson et al.l 
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l2004l) . thereby confirming the basic hot-wall model to first or- 
der. However, images with the highest spatial resolution reveal 
the presence of BPs away from the dis k center, and of facular el- 
eme nts even close to the disk center (Hirzberger & Wiehrll2005l: 
Ber ger et al.ll2007l) . Such mixtures of BPs and faculae at sev- 
eral heliocentric positions is not explained by the hot-wall pic- 
ture considering vertical flux t ubes, and seems no t apparent in 
the simulated synthetic images dKeller et al.ll2004 . Further, it is 
not clear either whether the BPs and faculae seen at different 
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heliocentric angles are associated with similar magnetic struc- 
tures, or rather with different structures prone to selection ef- 
fects Cites et al. 2004; Solanki et al. 2006). This shows that the 
transition from BPs to faculae is not clearly understood, and cur- 
rent models aiming at reproducing BPs and faculae would bene- 
fit from a quantitative study of the distribution of these features 
on the disk. 

To tackle these issues, a necessary step is to sort the BPs and 
faculae observed at various disk positions, in order to treat them 
separately. The approach proposed here is the first attempt in 
this direction, and relies on Linear Discriminant Analysis (LDA) 
(lFischedll936 ) as a basis to "classify" features as BPs or facu- 
lae. Our method makes use of purely photometric information, 
so that it only distinguishes the features appearing as BPs or 
as faculae. We applied this method to high-resolution images 
of active regions, covering a range of heliocentric angles where 
the transition from BPs to faculae is expected. This allowed us 
to retrieve, for the first time, an estimate of the center-to-limb 
variation of the relative amount of both features, and thereby to 
quantitatively grasp how the appearance of magnetic elements 
varies from center to limb. 

Although Discriminant Analysis h as been fruitfully used i n 
Astronomy (see the general review bv lHeck & Murtaghlll989h . 
its application in the framework of solar physics thus far has 
been restricted to the study of the conditions triggering solar 
flares (the aim of "probabi l istic flare forecasti ng" Smith et ak, 
Il996t iLeka&BarnesI 120031: iBarnes et al.ll2007h . and to the re- 
spons e at the Earth's surface to the solar cycle dTung & CampI 
120081) . Therefore, this paper is intended to give a detailed de- 
scription of our classification method, and by the same token 
provides a concrete example of linear discriminant analysis ap- 
plied to solar data. Among other potential applications in solar 
physics, we mention the taxonomy of flares and the separation of 
chromospheric B Ps and cos mic ray spikes on wavelet-analyzed 
images ( Antoine et al ] l2002h . 

The structure of this paper reflects the path taken to re- 
solve the classification problem. Section |2] describes the origi- 
nal dataset processing, and the automated segmentation method 
by which bright features were detected at each disk position. 
Section [3] outlines the classification scheme while briefly pre- 
senting the principles of LDA. It also gives a detailed report 
of how this technique can be applied to a selected sample of 
BPs and faculae in order to derive a single discriminant vari- 
able, based on which a simple classification rule can be built. 
Section m then deals with the actual classification of all the seg- 
mented features, as well as the discussion of these results from a 
methodological and physical point of view. Finally, Sect.|5]sum- 
marizes the obtained results and gives future directions for such 
work. 



2. Image processing and segmentation 

2.1. Dataset processing 

The original dataset consists of simultaneous G-band (430.5 
+ 0.5 nm) and nearby continuum (436.3 + 0.5 nm) images 
recorded at the Im Swedish Solar Telescope (SST, La Palma), 
on 7th and 8th September 2004. They cover active regions at 
seven disk positions in the range 0.56 < (p) < 0.97, where p = 
COS0, is the heliocentric angle and (p) corresponds to the center 
of the respective field of view (FOV), equivalent to the mean p 
over the whole FOV (cf. Table [T]). This range of disk positions 
contains both BPs and faculae, and is thus well-suited to investi- 
gate their transition. Because our study requires the highest spa- 



tial resolution in order to resolve individual BPs and faculae, the 
dataset was restricted to the one to three best image pairs at each 
disk position (obtained at peaks of seeing), which were kept for 
further processing and analysis (see Table[T]). 

For the selected image pairs, phase-diversity reconstruction 
allowed a roughly constant angular resolution to be achieved, 
close to the diffraction limit (~ 0"1 at 430 nm). The recon- 
structed simultaneous image pairs (G-band and continuum) were 
aligned and destretched using cross-correlation and grid warp- 
ing techniques (courtesy P. Siitterlin). The direction of the clos- 
est limb was found by comparison with roughly co-temporal 
SOHO/MDI full disk continuum images, and the images were 
divided by the limb darkening /i-polynomial of Neckel & Labsl 
(1 19941) at the nearest tabulated wavelength (427.9 nm). For 
each image pair, the contrast C was then defined relative to 
the mean intensity (/)qs of a quasi-quiet Sun subfield (of area 
ranging from 44 to 114 arcsec^, depending on the image) as 
C - (I - {I}qs)/ (/)qs- The G-band and continuum contrast are 
hereafter denoted Co and Cc, respectively. To enhance the seg- 
mentation process (see Sect. 12.2b . we applied a high-pass spatial 
frequency filter to remove medium and large-scale fluctuations 
of the intensity (with observed spatial scales between 5 and 30"), 
presumably attributable to p-modes, supergranular cell contrasts, 
straylight and residual flat-field effects. The Fourier filter was of 
the form f(k) = 1 - e""'''^, where k is the modulus of the spa- 
tial frequency, and the parameter a was set to have a cut-off fre- 
quency {F(k) - 0.5) of 0.2 arcsec~' and full power (F(k) - 1 ) 
at 0.65 arcsec ' (in accordance with'Hirzb erger & Wiehj|2005h . 
Finally, sunspots and large pores featuring umbral dots were 
masked out, together with their immediate surrounding granules. 
This prevents the contamination of BPs/faculae statistics by fea- 
tures of a different physical nature. Figures [1] and [2] show exam- 
ples of G-band images at (p) = 0.97 and (p) - 0.6, respectively, 
in which the quiet Sun contrast reference and the masked out 
sunspot and pore areas are outlined. 

2.2. Magnetic brigiitening segmentation 

Prior to their classification as BPs or faculae, bright magnetic 
features at the different disk positions of our dataset were de- 
tected by a segmentation algorithm. The aims of our algorithm 
were twofold: 

1. Detect magnetic brightenings photometrically by compari- 
son of their contrast in G-band and continuum. 

2. Decompose groups of BPs and striated faculae into in- 
dividual elemen ts by using Multi-Level- Tracking (MLT, 
iBovelet & Wiehdl200lL 120031120071) . 

The second point significantly increases the statistics of the 
study, and relies on the assumption that these elements corre- 
spond to distinct magnetic feature s. This h as been justified for 
intergranular BPs at disk center dBerger & Title 2001), while 
observations of the dynamic behavior of striated f aculae sug - 
gest a correspondence with those BPs (.De Pontieu et al.ll2006l) . 
the dark striat ions being associated with sites of lower magnetic 
field strength (Berger et al] l2007l:ICarlsson et al.ll2004l) . 

The principle behind point 1 is best illustrated by Cq vs. Cc 
scatterplots of a plage area, as shown in Fig.[3](see Figs.[T]and|2] 
for the location of the chosen plage subfields). As can be seen in 
Fig. [3] the scatterplot splits into two clearly distinct pixel distri- 
butions. A similar pattern appears in the diagnostics of radiative 
MHD simulations of Shelvag et al. (2004), where the upper dis- 
tribution is shown to be associated with strong flux concentra- 
tions, whereas the lower one corresponds to weakly magnetized 
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Fig. 1. G-band image of NOAA 0669 at (yu) = 0.97 recorded on 7th September, 2004. The dashed Hnes outline the spot and pore 
areas masked out for the segmentation. The "QS Reference" indicates the quasi-quiet Sun subfield chosen as reference for the 
contrast. The "Plage" subfield is the one selected for the Cc vs. Cc diagram shown in Fig[3] The arrow indicates the direction of the 
closest limb. 
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[arcsec] 

Fig. 2. G-band image of NOAA 0671 (fj.) = 0.6 recorded on 8th September, 2004. The "Plage" area was used for the Co vs. Cq 
diagram plotted in Fig.[3j 
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Table 1. Dataset specifications: //max) indicates the ju coverage of the images, after their rotation along the disk radius vector 
pointing towards the closest limb. FOVgff is the effective field of view once the spots and pores have been masked out. The number 
of image pairs selected for processing and analysis are given in the last column. 



Date 


NOAA 






FOVeff [arcsec^] 


Number of Pairs 


07-Sept-2004 


0669 


0.97 ± 0.003 


(0.963, 0.977) 


2034 


1 


07-Sept-2004 


0671 


0.78 ± 0.008 


(0.75, 0.8) 


2135 


1 


08-Sept-2004 


0670 


0.97 + 0.003 


(0.963, 0.976) 


1906 


1 


08-Sept-2004 


0667 


0.937 ± 0.003 


(0.928, 0.945) 


1027 


2 


08-Sept-2004 




0.9 + 0.005 


(0.882, 0.916) 


2400 


1 


08-Sept-2004 


0671 


0.63 ± 0.01 


(0.58, 0.67) 


1923 


3 


08-Sept-2004 


0671 


0.6 ± 0.01 


(0.55, 0.64) 


1763 


2 


08-Sept-2004 


0671 


0.56 ± 0.01 


(0.51,0.6) 


1904 


1 


13-Aug-2006 


0671 


0.77 ± 0.008 


(0.763, 0.776) 


566 


1 



granules (see also lSanchez Almeida et al ] l2001l for the compar- 
ison of different ID LTE atmospheres). We can thus select pix- 
els which are G-band bright and likely to be of magnetic origin 
by imposing two thresholds: a G-band threshold Cc.t selecting 
the bright portion of the diagram (dashed lines in Fig. [3]), and a 
threshold Cd ff.t on the contrast difference Cdiff = (Cg - Cq) (see 
iBereer et al.ll 19981 for more details). 

To achieve point 2, we chose a set of closely-spaced MLT 
levels between Cct and Cp - 0-7. The interlev el spacing was 
tuned to 0.02 (similar to lBovelet & Wiehijl2007l for BPs at disk 
center) by visual comparison of the segmentation maps with the 
original images. This spacing allowed chains of BPs and faculae 
striations to be resolved, while avoiding over-segmentation. The 
structures were then extended down to Cg = with two interme- 
diate levels at Cq = 0.1 and Co - 0.05. This extension increases 
the segmented area of faculae compared to BPs, allowing its fur- 
ther use as discriminant parameter (see Sect. 13.3b . The interme- 
diate levels prevent the merging of BPs with adjacent granules 
and the clumping of granular fragments when the contrast of in- 
tergranular lanes does not drop below Co = 0. Since a necessary 
condition for a feature to be selected is to have its contrast max- 
imum above Cc,t, no other levels were included between Cq = 
and Cc.t to avoid oversegmentation. Likewise, structures of less 
than 5 pixels in area (corresponding to the area of a roundish fea- 
ture with a diameter of 0''1, i.e. roughly equal to the diffraction 
limit) were removed at each MLT level. 

The segmentation algorithm then proceeded in two steps: 
First, MLT was applied to the spatially-filtered G-band images. 
In a second step, structures corresponding to "magnetic" fea- 
tures were selected by requiring them to contain a minimum 
of 5 pixels satisfying Co > Cc.t and Cdiff > Cdiff.t- A binary 
map of segmented features was ultimately obtained for each G- 
band/continuum image pair 

From Fig. |3] one notices that the "magnetic" and "non- 
magnetic" pixel distributions overlap more at (//) - 0.6 than 
at {fi} - 0.97, a tendency that was generally observed for de- 
creasing (yu). To avoid the false detection of granules, the G-band 
threshold must then be raised as (jj) decreases, inasmuch as the 
"magnetic" pixel distribution extends towards larger values of 
Cg while the "non-magnetic" one reaches lower values (as the 
rms contrast of granules decreases towards the limb). To do this 
consistently, we determined a CLV of maximum G-band con- 
trasts Co.max of features segmented independently of the G-band 
threshold. Specifically, taking one image pair at each disk po- 
sition, the features were segmented solely by a safe difference 
threshold Cdiif,t = 0.1, and the ones having fewer than 20 pix- 
els above this threshold were remo ved (as most non-ma gnetic 
detections contain only a few pixels. iBerger & Titlell200ll) . A vi- 



sual count over a portion of the images yielded an estimate of 
the remaining fraction of false detections, approximatively 4 %. 
Under the reasonable assumption that these false detections were 
also the faintest, we adjusted the G-band threshold consistently 
at all (jj) to eliminate the ~ 4 % faintest features (the chosen 
value of the threshold was actually rounded up, and was con- 
stant throughout the field of view at each {fi}). The values of the 
features maximum G-band contrast Cc.max as well as the (yu)- 
dependent G-band thresholds are plotted vs. (//) in Fig.|4] 

Unlike the G-band threshold, the difference threshold Cdift.t 
can be set to a unique value for all disk positions, inasmuch 
as the "non-magnetic" distribution has a slope roughly equal to 
unity at all (ju). To set Cdiif,t properly, we made use of "test" 
data consisting of a single G-band/continuum image pair ob- 
tained with the same setup as our original dataset (and pro- 
cessed as in Sect. 12.11 except for speckle reconstruction), but 
supplemented with SOUP (Lockheed Solar Optical Universal 
Polarimeter) Stokes V and / maps, recorded in the wing of the 
Fe I 6302.5 A line with a detuning of 75 mA. Given the value of 
the G-band threshold for that disk position, the difference thresh- 
old was tuned such as to minimize the fraction of "false" de- 
tections. By considering false detections as having less than 5 
pixels with \V/I\ > 0.075 (well above the noise level ~10"^), the 
optimal difference threshold was found as Cdiff.t = 0.08. The cor- 
responding fraction of false detections amounts to roughly 2 %. 
These test images were, however, not used further because they 
were focused on a large sunspot and hence contain a very small 
effective field of view (see Table[T] 13-Aug-2006). 

Without information about the magnetic field itself, our seg- 
mentation has to rely on purely photometric thresholds, and 
hence cannot detect all the magnetic features. The combined 
thresholds only aim at detecting a sample of bright features 
that is least biased by non-magnetic ones. However, the use of 
thresholds always implies the drawback of selection effects. In 
particular, the G-band threshold will neglect fain ter features, es- 
pecia l ly low-contrast BPs near disk center (see Title & Bergeil 
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3. Discriminant analysis of bright points and 
faculae 

3. 1 . General scheme and training set 

To develop an algorithmic classification method for BPs and 
faculae, we adopted the following scheme, that uses Linear 
Discrimi nant Analysis (LDA, a statistical technique first intro- 
duced bylRsche^QSl on a reference sample of features: 
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Fig. 3. Cg vs. Cc scatterplots for selected plage subfields of area approximately 10 x 12 arcsecs^ at (fj.) = 0.97 (left) and (fj.) = 0.6 
(right). The exact locations of these subfields within their respective images are outlined in Figs. [T] and |2] (rectangles denoted 
"Plage"). The solid line corresponds to the difference threshold Cdiff,t and the dashed line to the G-band threshold Cc.t. 
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Fig. 4. Maximum G-band contrast values (small crosses) of all 
features segmented only with a difference threshold (CdifF.t - 
0.1) at each disk position. The dashed line corresponds to 
the chosen G-band threshold removing approximately the 4% 
faintest features, which are dehmited exactly by the large 
crosses. 



classifi cation can be found in lMurtagh & Heci3 (Il987h and lHandl 
(IT98T1) . 

Our training set was chosen as a sample of 200 BPs and 200 
faculae, obtained by manual selection of 40 features of each at 
each of five disk positions: {(//) = 0.56, 0.63, 0.78, 0.9, 0.97| for 
faculae and = 0.63, 0.78, 0.9, 0.94, 0.97) for BPs. Because it 
is used as a reference for the classes, the selected sample should 
be statistically representative of the actual populations of BPs 
and faculae (such as would be identified by eye). At each disk 
position, care was thus taken to select the most homogeneous 
mixture of features with various contrasts and sizes, distributed 
over the whole field of view. 

It should be kept in mind that BPs and faculae are possibly 
not two distinct types of objects, but the radiative signatures of 
more or less similar physical entities (magnetic flux concentra- 
tions) viewed under different angles. Consequently, there may 
well be no sharp boundary between the two classes, but rather a 
continuous transition with a spectrum of "intermediate features", 
having various degr ees of "projection" onto th e adjacent limb- 
ward granules (see iHirzberger & Wiehj I2005L and Sect. I3.2l i. 
The concept of classes can nonetheless be introduced to repre- 
sent the populations of features that would be reasonably identi- 
fied as BPs and faculae upon visual inspection, but the approach 
proposed here cannot claim to classify the intermediate features 
mentioned above. 



1. Training set selection: Extraction of a reference sample of 
features, visually identified as BPs and faculae. 

2. Discriminant parameter definition: Choice of observables 
taking sufficiently different values for the BPs and faculae 
of the training set, in order to be of use for the further dis- 
crimination of the rest of features. 

3. LDA: Determination of a unique variable by linear combina- 
tion of the chosen parameters, such that it best discriminates 
between the two classes of the training set. 

4. Assignment rule: Imposition of an adequate threshold on the 
discriminant variable defined by LDA, separating the BPs 
and the faculae of the training set. 

Ultimately, all the magnetic brightenings detected by the seg- 
mentation algorithm can be classified according to the assign- 
ment rule, by measuring their value of the variable defined by 
LDA. Comprehensive manuscripts about the general topic of 



3.2. Characteristic profiles 

As a basis to define discriminant parameters, we considered the 
spatial variation of G-band contrast along a cut made through a 
BP or a facula. Small magnetic features are indeed known to ex- 
hibit more pronounced signatures in the G-band than in contin- 
uum, and such contrast profiles have characteristic shapes when 
BPs and faculae are cut along specific directions: radially for 
limb faculae, and across the intergranular lane for disk center 
BPs (Berger et al. 1995; Hirzberger & Wiehr 2005). 

The following procedure was developed to retrieve one char- 
acteristic profile per feature, independently of the feature type 
and disk position: First, each feature was oriented in a local co- 
ordinate frame x/y as illustrated in Fig. |5] The x/y axes were 
defined such as to minimize the y-component of the feature's G- 
band "contrast moment of inertia" Mcy = 'ZjCG{x,y){x- x^ax)^, 
where jCmax is the x-location of the contrast maximum Ccmax- 
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To give optimal results on the orientation, the summation only 
ran over pixels having Cc > 0.5 Ccmax, thus involving only the 
"core pixels" of the feature^. In practice, the minimum of Mc,y 
was found by iteratively rotating a small window surrounding 
the feature with 5° steps (smaller steps did not yield better re- 
sults, due to the finite number of pixels considered). Next, con- 
trast profiles were obtained along x and y by averaging the rows 
and columns of that window having pixels with Cc > 0.9 Ccmax 
(delimited by black lines in Fig. Such x/y profiles are dis- 
played in Fig.|6]in the case of a typical BP (right) and a typical 
facula (left). These profiles were further restricted to the contrast 
range Cc > about Cc.max (delimited by the lower marks), 
such that all profiles share a consistently-defined reference level 
Cg = 0. Finally, the single characteristic profile for each feature 
was found to be the smoothest of the positive contrast-restricted 
x/y profiles (overplotted in thick). To quantify the smoothness of 
the profiles, we counted the number of their local extrema, even- 
tually adding the number of inflexions if the number of extrema 
was equal in x and y. The use of MLT segmentation (as opposed 
to a single-clip) is an essential prerequesite for obtaining these 
characteristic profiles, by avoiding that pixels from adjacent fea- 
tures contaminate the contrast moment of inertia and thus spoil 
the feature's orientation process. 



(o) 



(b) 



(c) 




Fig. 5. Orientation of an individual feature in its local x/y coordi- 
nate frame, a) Zoom window surrounding the feature in the orig- 
inal G-band image. The cross indicates the location of the con- 
trast maximum, b) Isolated feature as delimited by the segmen- 
tation map. The pixels having Cg > 0.5 Ccmax are highlighted 
in grey, and are used to compute the G-band contrast moment 
of inertia, c) Window rotated such that the y-component of the 
G-band contrast moment of inertia Mc y is minimum, thereby 
defining the local x/y coordinate frame. The rows and columns 
used to retrieve the average profiles along x and y are contained 
between the straight black lines. 

Owing to the previous orientation of the features, the charac- 
teristic profiles exhibit different shapes for BPs and faculae, and 
consequently proved very useful for the extraction of valuable 
discriminant parameters (see Sect. 13.31 ). In contrast, profiles re- 
trieved along the disk radius vector (as performed in early stages 
of this work) have less characteristic shapes and thus less power 
to distinguish BPs from faculae, due to the scatter in the orien- 
tation of these features with respect to the radial direction. As 
can be seen in the examples of Fig. |6] the characteristic profile 
of the typical BP is narrower and steeper than the profile of the 
typical facula. In particular, the characteristic profile of the fac- 
ula is indistinguishable from the adjacent granule, as the contrast 
varies monotonously from one to the other We mention the re- 
semblance of the characteristic profiles of the BP and facula to 



the ob servations of lBerger et al.l (Il995h and lHirzberger & Wiehil 
(2005), respectively, as well as with the synthetic profiles of 
Knolker et al. ( 1988) and Steiner (2001). 

Due to finite resolution, straylight, and the partial compensa- 
tion of spatial intensity fluctuations by the filter (see section l2Tb . 
it is com mon to find BPs embedd ed in "grey" lanes with positive 
contrast (Bovelet & Wiehrll2007h . Upon careful visual analysis 
of grey lane-BPs profiles, we identified these grey lanes as con- 
trast depressions with a low minimum (CG.min ^ 0.1), separating 
the BP profile from the adjacent granule profile. As most normal 
BPs profiles have quasi-linear slopes at their edges, the sides of 
profiles featuring grey lanes were linearly extrapolated down to 
Cg = 0. Only after this could the x and y average profile be prop- 
erly restricted to positive contrast values, and their smoothness 
compared for the adequate retrieval of the characteristic profile. 
This linear extrapolation is illustrated in Fig. [7] for characteris- 
tic profiles of both BPs and "intermediate features", indicating at 
the same time the variety of feature profiles that can be obtained. 



o 




' Involving pixels with lower contrast yields poorer results, as these 
are often associated with granulation in the case of faculae, and thus do 
not carry information about the orientation of the facular brightening 
itself 











o 






„o 


-0 
-0 




o 

o 

-0 



Fig. 7. Characteristic profiles (thick lines) of BPs (a, b) and in- 
termediate features (c, d) surrounded by one or two "grey" lanes. 
On the grey lane sides, the characteristic profiles have been lin- 
early extrapolated from the half-max level ("H-") to Cg = 0. 
From a) to d), these profiles illustrate the continuous transition 
between the typical BPs and typical faculae, for which examples 
are shown in Fig. |6] 



3.3. Discriminant parameters 

In search of adequate discriminant parameters, we carried out 
a pilot study by defining a set of parameters. These included 
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Fig. 6. Local frame orientation and G-band contrast profiles and of a typical facula {left) and a typical BP {right). Top windows: 
Orientation of the features in their local xjy coordinate frames, where the black lines delimit the pixels having Cq > 0.9CG,max used 
for profile averaging. The white contours obtained from the segmentation map enclose the area A of the features. Lower panels: 
Average G-band contrast profiles along x and y. The retrieved characteristic profile for the BP and the facula is indicated by the 
thick lines. The "+" marks intersecting the reference level Cg = (dashed line) delimit the positive contrast portion of the profiles, 
and the upper "+" marks indicate the half-max level Cg - O.SCc.max on the characteristic profiles. The parameters A and V are 
illustrated on the characteristic profiles of the facula and the BP, respectively. All the x/y profiles were cubic spline-interpolated by 
a factor of 10, in order to avoid artificial roughness due to sampling when choosing the characteristic profile (as the smoothest of 
the x/y profiles, see Sect. 13.2b . 



the peak-to-width ratio, area asymmetry and second moment of 
the characteristic profiles, the loc al contrast relative to th e im- 
mediate surroundings (similar to iBovelet & WiehH l2003h . and 
the contrast of adjacent lanes. By looking at the distribution of 
the parameter values for the BPs and faculae of the training set 
(mean values and standard deviation at each (p), see below) as 
well as their correlation, three roughly mutually independent pa- 
rameters were eventually found to be good discriminants for the 
training set classes. Their definitions are illustrated in Fig.|6l 

- A := width of the characteristic profile at the reference level 
Cq -0 [arcsecs]. 

- V := average slope (from both sides) of the characteris- 
tic profile below the half-max level Cg.hm = 0.5CG,max 
[arcsecs"']. 

-A: - apparent area (projected onto the plane of the sky) of the 
feature defined by the segmentation binary map [arcsecs^]. 

We emphasize that the three chosen parameters are defined using 
relative contrast levels (Co = and Cg = O.SCc.max), allowing 
the comparison and classification of features having different ab- 
solute contrast values (notably due to the CLV of contrast). 

Fig.|8](left column) shows the mean values and standard de- 
viations of the parameters A, A and V at the yu-values of the train- 
ing set. These parameters describe well the different appearances 



of BPs and faculae for the following reasons. The best discrim- 
inant parameter. A, takes greater values for faculae as it encom- 
passes the width of the adjacent granular profile (as the facular 
and granular profile are merged together, cf. Fig.|6]l, whereas BPs 
are limited to the width of intergranular lanes. The parameter V 
describes how steeply the contrast drops towards the edges of the 
profile and typically has larger values for BPs, which show steep 
and symmetric contrast enhancements squeezed between the ad- 
jacent granules. To supplement these two profile parameters, the 
segmented feature area A has been added to avoid that faculae 
with small widths (typically lying on small abnormal granules 
frequently found in active regions) are classified as BPs. In area 
these faculae appear significantly larger. 

As can be seen from Fig. [8] the parameter values do not 
vary significantly with (p), as the difference between the largest 
and smallest mean values over the whole p range barely exceed 
their standard deviations. The relative constancy of width and 
area is particularly surprising for faculae, and could be due to 
a compensation of granular foreshortening by enhance d radia- 
tive escape in the direction toward the flux concentration ( Steinej 
2005), as well as to the distribution in the orientations of faculae 
(to be discussed in a forthcoming paper). The relative invariance 
of the parameters is nevertheless advantageous, as it justifies per- 
forming LDA on the whole training set at once (all (p) together), 
thus allowing us to find a single linear combination of parame- 
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Fig. 8. Left column: Mean values and standard deviations of the three parameters for the BPs ("o" and thin bars) and faculae ("A" 
and thick bars) of the training set vs. disk position (/i). The training set does not contain BPs at (p) - 0.56 or faculae at (yu) = 0.94. 
To give an idea of the outliers and the // distribution of the training set features at each disk position, individual features values are 
overdrawn (small "o" for BPs and small "A" for faculae). The values of the individual features were computed by using the (fi) 
(center of FOV) of the corresponding images as reference. Right column: Normalized density functions (DFs) histograms of the 
log-transformed parameters for BPs ("o") and faculae ("A"), obtained by combining all disk positions of the training set together. 
Cubic splines are overplotted for clarity and their maxima were used for the normaUzation of the histograms. 



ters and a single BPs/faculae threshold valid for all the disk po- 
sitions of our dataset. Moreover, combining all the training set 
features enhances the sampling of the classes and yields a more 
accurate threshold. 

It should be noted that the values of the parameters A, A and 
V depend to some extent on spatial resolution. This points to the 



necessity of having a dataset of roughly constant resolution, a 
condition met by our selected image pairs (see Sect. 12.1b . 

3.4. Linear Discriminant Analysis 

Because LDA distinguishes classes based solely on means 
and covariances (see Equation [U, it works best for param- 
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eters that are normally or at least symmetrically distributed 
(iMurtagh & Hecklll987 l). To verify this condition, we studied 
the density functions (DFs) of our three parameters by produc- 
ing histograms of the training set, an d estimated the skewnesse s 
via the third standardized moment dKennev & Keepindll962h . 
Taking the natural logarithm was foun d to reduce the skewness 
of all parameters (Limpert et al.'200l"), and therefore they were 
replaced by their log-transformfl The DFs of log(A), log(A) and 
log(V) are displayed in Fig.[8](right column). 

Having the correct parameters in hand, LDA could then be 
caiTied out to find their linear combination that best discrimi- 
nates the training set classes. Explicitely, we searched for the 
axis vector 'a that maximizes Fis cher's separab ility criterion (as 
introduced in the original work of lFischerll936h in the parameter 
space {x = (log(A),log(A),log(V))): 



7(a) = 



[a'^(iMbp - muc)f 
a^(S bp + 5 fac)a 



(1) 



where the superscript T denotes transpose, »ibp and rrif^c are the 
class mean vectors and 5'bp,5'fac the covariance matrices. The 
original parameters could then be projected onto o^, thereby ob- 
taining the desired linear combination defining the single vari- 
able F = a X (for "Fischer" variable). The 2D projections of the 
3D training set vectors are represented in Fig.|9^-c, with an over- 
laid axis coiTesponding to the direction of maximum separability 
'a. Fig.|9}l displays the DF of the obtained variable F. The max- 
imal value of J associated with the variable F (projection onto 
fl) and the values of J associated to each parameter (projection 
onto the parameters' axes) are listed in Table |2] These values 
give an idea of the relative "discriminant power" of the three pa- 
rameters, and the larger J value of the variable F demonstrates 
the advantage of their optimal linear combination provided by 
LDA. 

The DFs of the discriminant parameters (Fig.[8]right column) 
and of the variable F (Fig.|9}l) also give a good visual estimate 
of the amount of overlap between the training set classes. An in- 
tuitive measure of the discriminant power of a parameter could 
then be given by the ratio between the number of features con- 
tained in the overlapping part of the DFs and the total number of 
training set features. This ratio already takes a fairly low value 
of 0.07 for log(A), and goes down to 0.042 for F. However, such 
a measure is statistically poor compared to J, since it mostly 
relies on the outliers contained in the tails of the DFs (only 28 
and 17 features for log(A) and F, respectively), whereas J takes 
advantage of the full parameter distributions. 

The overlap of the DF of log(A) and the particular skew- 
ness of the DF of faculae towards small areas arises from our 
MLT segmentation. To investigate the influence of the MLT lev- 
els on the DFs of the dicriminant parameters and on LDA, we 
carried out tests with fewer MLT levels over the same training 
set. We found that the skewness of the DF of log(A) for faculae 
is in major part due to their segmentation into fine striations. It 
should be noticed that the DF of log(A) is less skewed, because 
the characteristic profiles of these striated faculae are mostly re- 
trieved along the long dimension of the striations (owing to their 
individual orientation, see Sect. 13.21 ). which makes A a robust 
parameter to distinguish them from BPs. However, the coarser 
segmentation of the tests has the undesired effect that a part of 
the features are undersegmented, which leads to lower values of 
J for all parameters as well as for the discriminant variable F. 



^ This was partly expected, as the width of magnetic bright poi nts has 
been observed to be log-normally distributed fflerger et aljl99^ . 



Table 2. J values associated with each discriminant parameter, 
and for the variable F obtained by linear combination of the 
three parameters. The a's are the coefficients of the linear com- 
bination (absolute values). 





log(A) 


log(A) 


log(V) 


F(3D) 


J 


3.17 


4.71 


1.8 


6.27 


a 


0.27 


0.59 


0.14 





Due to the merging of BPs into chains and ribbons, their DFs 
are particularly affected and become skewed towards larger A, A 
and smaller V. For A and V, this is probably a consequence of the 
misorientation of merged features when retrieving the character- 
istic profiles. We believe that those tests confirm our appropriate 
choice of MLT levels for the purpose of further discriminating 
between individual BPs and faculae. 

We stress that the procedure of orienting the features prior to 
the retrieval of their contrast profiles, as described in Sect. 13. 21 is 
an essential ingredient to obtain discriminant parameters based 
on those profiles. In early stages of this work, profiles were only 
retrieved along the direction perpendicular to the closest limb, 
and the ensuing overlap of the DFs was significantly larger. 

Finally, it should be noted that the values of these photomet- 
ric discriminant parameters all depend to some extent on the spa- 
tial resolution. V is probably the most sensitive in that respect, 
but it has the least weight in the variable F due to its lower value 
of J (see Table This points to the requirement of having a 
dataset of roughly constant resolution, a condition met by our 
selection of images (Sect. lTTI ). 

4. Classification results and discussion 

4.1. Hard threshold vs. reject option 

To build an assignment rule, we made the usual choice of 
a threshold valu e Ft at eq ual "standardized" distanc e from 
the class means (lMahalanobis..l936: .Murtagh & Heck.il 987h . 
namely: 



(a^jMbp - Fif 



a 5faca 



(2) 



This threshold is drawn on the density function histogram of F 
inFig.HJl. 

In a first step, a single hard threshold equal to Fi was used to 
classify all the segmented features as BPs or faculae by measur- 
ing their values of F. Because we are not interested in absolute 
numbers, but rather in the relative distribution of BPs and facu- 
lae, we define the classified fractions of BPs and faculae as: 



^bp = 



A^bp 



A^bp + A^fa, 



^fac = 



A^bp + A^fa< 



(3) 



where A^bp and A^fac are the number of classified BPs and faculae, 
respectively. The CLV of the fractions Xbp and Xfac, classified 
using the threshold F^, is depicted in Fig.fTOh. 

As already stated at the end of Sect. 13.11 there is a continu- 
ous spectrum of intermediate features between BPs and faculae 
that cannot be reasonably identified as belonging to one class 
or the other. The only way to avoid the erroneous classification 
of these features is to exclude them from the statistics by intro- 
ducing a so-called "reject option" (Hand 1981), in the form of 
a rejection range in F centered about Fi. Assuming that all in- 
termediate features fall within the rejection range, the relations 
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Fig. 9. (a-c) 2D projections of the 3D training set vectors {log(A), log(A), log(V)} for the BPs ("*") and faculae ("A") classes of 
the training set, together with an axis indicating the projected direction of maximum separability 'a (dashed-dot line), d) Spline- 
interpolated density function histogram of the variable F, with the BP-faculae threshold F, (long dashed line) and the rejection 
range corresponding to the apparent rejection rate a = 0.2 (grey zone). 



between classified and true numbers A^bp*, A^fac* (such as would 
be recognized by eye) at each disk position are: 



A^bp = A?bp*(l -/?*) + e*Nf,,* - eNbp\ 



ff'A^fac* +/3*A^bp* +A^int, 
Mot = A^bp + A^fac + A^rej = A^bp* + Mac* + Mn 



Mrej = 



(4) 

(5) 

(6) 
(7) 



where a* and /3* stand for the "true" rejection rates, i.e. the frac- 
tions of the actual faculae and BP populations that fall in the 
rejection range, and e*, represent the "true" misclassification 
rates. The boundaries of our rejection range were tuned to have 
equal "apparent" rejection rates, a - /3, defined as the fractions 
of the training set contained in that range. Assuming that the 
training set adequately represents the true BPs and faculae pop- 
ulations, namely a* ~ a and /3* ~ /3, this precaution equalizes 
the true rejections as well (a* ~ jj*), and thus prevents the intro- 
duction of bias in the number statistics. The apparent rejection 
rate was set to a reasonable value for the rejection of intermedi- 
ate features, a - 0.2 (see below), yielding the shaded rejection 
range on the histogram of F in Fig.|9}J- 

Under the full rejection of intermediate features, the classi- 
fied fractions then become: 



^bp - 



Mbp*(l-y6*) + e*Mfac*-gAfbp* 
Mbp*(l-/3*) + Mfac*(l-«*) 



(8) 



for BPs and similarly for faculae. Note that if the training set is 
representative and the misclassification rates can be considered 
negligible, relation|8] further simplifies to X^^ ~ X^^, where X^^ 
stands for the true fraction of BPs, namely N^^/(N^^+N J (same 
remark for faculae). The CLV of classified fractions Xbp and Xfac, 
obtained by adding a rejection range with a = 0.2, is shown in 
Fig.Mp. 



Compared with the CLV of X\,p and Xfac obtained with a hard 
threshold, the difference between Xbp and Xfac at each (//) is now 
systematically larger (except for (fi) - 0.9). This is probably 
an effect of the contamination by intermediate features in the 
hard threshold case, because such features are assigned roughly 
equally to each class, so that they have the tendency to equalize 
Xbp and Xfac. The difference between X^p and Xfac is particu- 
larly large for (fi) = 0.97 and for the limbward data points at 
ifi) < 0.64. This is likely to be attributed to the larger misclassi- 
fication errors in the hard threshold method than with the reject 
option. Indeed, as can be seen from relation (|4]i, the misclassifi- 
cation errors also tend to overestimate the number of BPs near 
the limb where faculae dominate, while underestimating it near 
disk center, and vice versa for faculae. To estimate the true mis- 
classification rates, we computed the apparent misclassification 
rates e, ^ by reclassifying the training set, assuming as before 
that the latter adequately represents the true populations. With 
the chosen rejection range, we obtained e, ^ = 0.005, which is 
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an order of magnitude smaller than in the the hard threshold case 
and should be reflected in the true rates as welQ. 

We shall now elaborate on the validity of the aforementioned 
assumptions, and further justify the use of a reject option as op- 
posed to the hard threshold classification. A subtle source of er- 
ror is the departure of the actual populations from the training 
set ones, causing a* + p* . To evaluate the importance of this 
effect, we have varied a in the range (0.2, 0.5), which should in 
that case induce unequal variation of a* and fi* and consequently 
different variations of Xbp and Xfac (cf. Eq. [8]). By the same to- 
ken, this allowed us to check if intermediate features were still 
wrongly classified as BPs or faculae for a - 0.2, as the sep- 
aration between X^^ and Xfac should then increase with a. But 
instead, we observed both positive and negative fluctuations of 
Xbp and Xfac, indicating that most intermediate features were in- 
deed rejected, and that the true rejection rates were nearly equal 
for BPs and faculae. As those fluctuations were always less than 
0.05, we chose this value as an upper limit on the error induced 
by uneven actual rejection rates, and represented it by the sym- 
metric error bars in Fig.fTOb. To compare the effect of rejection 
at the various disk positions, we overplotted in Fig. [TOb the frac- 
tion of rejected features with respect to the total number of fea- 
tures Ni-sj/Niot- The relative constancy of A^rej/Mot is reassuring, 
and reflects the self-similarity of the actual BPs and faculae pop- 
ulations at various {fj.} (as far as F is concerned). It also gives 
an indication about the number of intermediate features, as in 
absence of them we would have A^rej /Mot - cf, using as before 
a* ~ p* ~ a together with the relation (|6|l. As can be seen, 
the fraction of rejected features fluctuates around 0.4, indicating 
a significant fraction of intermediate features Mnt/Mot - 0.2, 
which further justifies the introduction of the rejection range. 

From a methodological point of view, despite the qualitative 
resemblance of the CLVs obtained with the hard threshold and 
with the reject option, the first method is open to criticism due 
to the large amount of intermediate features. A hard threshold is 
in this sense self-contradictory, as it tries to assign these features 
to classes whose reference (the training set) does not represent 
them. By contrast, if these features are properly rejected, and if 
the assumptions of negligible misclassification rates and repre- 
sentative training set {a* ~ /3*) are justified, the reject option 
method has the elegance that the classified fractions closely ap- 
proximate the true ones. 

Lastly, the results obtained here do not only depend on the 
choice of the training set, but also on the choice of the classifica- 
tion method. Fischer's LDA implicitely assumes similar covari- 
ance matrices for the classes, which is not quite true in our case, 
as can be seen from the different shapes of the BPs and faculae 
"clouds" in their 2D projections (Fig.|9l). We then implemented 
a "class-dependent" LDA, taking into account the difference in 
covariance matric es and deriving differen t discriminant axes for 
the two classes (B alakrishna ma et al. 1999). However, the differ- 
ence in the relative fractions obtained was insignificant, thereby 
indicating that the covariance matrices of our chosen parameters 
were suitable for Fischer's LDA. 



As these are only optimistic estimates fo the true misclassifica- 
tion rates, wejmgleniented^ method on the training 
set ( lHanclll98il : lMurtagh & Hecldll987h . which nevertheless gave the 
same results as the simple reclassification (probably due to the fairly 
large size of the training set). 



4.2. Discussion 

To assess the validity of the proposed method, the obtained re- 
sults can be compared with the observations of BPs and faculae 
at various /u available in the literature. To help the comparison as 
well as to give a visual idea of which features were classified as 
BPs and faculae, Fig.[TT] Fig.fT2land Fig.[T3]show their contours 
overlaid on the G-band images at (yu) = 0.97, (yu) = 0.6 (same 
images as in Figs. [T] and |2]i, and (fi) - 0.9 respectively. Our 
result s are consist e nt wit h recent high-resolution observations 
from iBerger et al.l (l2007l) . who noticed the presence of disk- 
center faculae at (//) - 0.97, very few intergranular BPs at {jj} = 
0.6, and a mixture of both features at (fi) = 0.89. The presence 
of some "intergranular brightenings" arou nd (u) - 0.55 also 
has be en reported bV iLites et al. ( 2004,) . and lHirzberger & Wiehrl 
(l2005h have clearly observed the coexistence of BPs and facu- 
lae at yu ~ 0.78. This suggests the validity of our classification 
method, although this should be confirmed in the future by using 
datasets with co-temporal magnetic vector information. 

Although our CLV cannot be generalized due to the limited 
statistics of the present dataset and the coarse sampling of the 
fi range, it allows us to constrain the fi interval where the tran- 
sition from BPs to faculae occurs. In this respect, the CLV also 
exhibits a plateau in the range 0.6 < yu < 0.78, indicating that 
BPs may still be found in that range, but progressively disappear 
closer to the lim b, probably affected by the foreground granu- 
lar obscuration (lAuffret & Mulleilll99lh . This plateau can also 
be attributed to the slower variation of the heliocentric angle in 
that yu range (36° < 6 < 54°) compared to the centerward range 
0.78 < yu < 0.97 (13° < e < 41°). Conversely, faculae appear 
to be present at all disk positions, except for the inner third of 
the disk where jj > 0.9. Hence, in contrast to full-disk images in 
which faculae patches are only prominent closer to the limb, at 
high resolution faculae are conspicuous features of active-region 
plages at all disk positions. 

The overall dominance of faculae in our dataset as well as 
the presence of BPs at relatively small yU values (yU ~ 0.6) cannot 
be understood in terms of the conventional "hot wall" picture, 
if we consider only vertical flux tubes and varying viewing an- 
gles with disk position. The most straightforward alternative is 
to invoke inclined fields (e.g. due to swaying motions), whereby 
BPs would arise from flux tubes aligned along the line of sight 
and faculae from flux tubes inclined with respect to it. Again, 
such an hypothesis should be verified with the help of magnetic 
vector data. 

From here on, we discuss the assets and weaknesses of the 
proposed method, as well as its applicability. A key point of the 
method resides in the orientation of individually segmented fea- 
tures so as to retrieve characteristic profiles (see Sect. 13.2b . This 
procedure makes the method applicable at different (jj) (due to 
the diverse orientations of BPs and faculae), and thereby offers 
the possibility of studying the transition from BPs to faculae as 
fi varies. In the considered fi range at least, the orientation pro- 
cess makes the discriminant parameters roughly yu-invariant, thus 
allowing LDA to be applied to the whole dataset at once (cf. 
Sect. 13.3b . LDA itself has the advantage of being fairly simple 
to implement, and of making few assumptions on the distribu- 
tion properties of the discriminant parameters (see Sect. 13.4b . It 
nevertheless requires the careful preselection of a training set, 
a crucial step that can potentially bias the classification. But 
the principal weakness of the current method lies in the use of 
photometric information only, allowing a limited number of dis- 
criminant parameters to be defined. This induces the following 
pitfall: faculae "sitting" on very small granules (fragments, ab- 
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Fig. 10. a) CLV of relative fractions of BPs ("o") and faculae ("A") obtained by classifying all the segmented features using a hard 
threshold Ff defined via equation (|2]). The abcissa of the points are the (yu) of the analyzed images, and the horizontal error bars 
correspond to the standard deviation of the individual features' ju values with respect to (ju). A^tot gives the total number of features 
at each disk position, which depends on the effective FOV of the images and the number of available image pairs at each disk 
position (see Table [T]l. The dotted lines are guides, b) Relative fractions of BPs ("o") and faculae ("A") after the introduction of a 
rejection range such that the apparent rejection rates of BPs and faculae are equal to 0.2. The vertical error bars (+5%) are upper 
limit estimates of the errors induced by uneven true rejections rates (see main text for details). The fraction of rejected features with 
respect to the total number of features is also plotted, as squares ("□") joined by a dashed line. 



normal granulation) are basically indistinguishible from BPs as 
far as our parameters are concerned. Several instances appear 
in Figs.[T2]and[T3] where such small faculae are either rejected 
or misclassified. The method could be improved by the inclusion 
of discriminant parameters coming from polarimetric maps, pro- 
vided that BPs and faculae exhibit sufficiently different magnetic 
properties. 

We conclude by drawing attention to precautions that should 
be taken in applying our method to other datasets. Having a fairly 
homogeneous and high spatial resolution is an essential require- 
ment, as the values of all parameters A, A and V depend on it. 
A variable resolution would cause the values of the parameters 
to vary throughout the dataset (between different images or even 
accross the field of view), thus preventing a well-defined dis- 
criminant variable and a unique BP/faculae threshold from being 
obtained. For a dataset with a constant but different resolution, 
the method would in principle still be valid, but the values of 
the parameters and of the class threshold would differ. However, 
degrading datasets of variable resolution to a constant lower one 
would reduce the contrast of features (loss of statistics due to 
the contrast threshold), and blur the local contrast depressions, 
so that it would become difficult to separate adjacent BPs and 
faculae striations. The method would also lose in efficiency due 
to the misorientation of merged features (cf. Sect. l3.2l i. Finally, 
care should be taken in applying unchanged the herein-derived 
discriminant F and its threshold value to other datasets. If the 
current method is applied to a dataset of slightly different res- 
olution (or with a d ifferent amount of straylight), wavelengths 
(e.g. CN-band, iZakharov et ani2007D or II range, the values of 
the segmentation thresholds should first be adapted (the same 
holds for the identification criteria of the "grey-lane" BPs), and 
the training set selection must then be repeated (as well as the 
subsequent steps of the method thereof). Under different con- 
ditions, the ensuing values of the discriminant parameters will 
be different, and consequently LDA will yield a different linear 



combination for the discriminant variable and a different thresh- 
old. 



5. Summary and outlook 

We have developed a photometric method based on Linear 
Discriminant Analysis (LDA) to discriminate between individ- 
ual Bright Points (BPs) and faculae, observed at high resolution 
over a range of heliocentric angles. We first demonstrated the 
feasibility of an automated segmentation for both individual BPs 
and faculae at various disk positions, based on joint G-band and 
continuum photometric information only. For each segmented 
feature, a "characteristic G-band contrast profile" was retrieved 
along a specific direction, by properly orienting the feature using 
its "contrast moment of inertia". Three physical parameters were 
then identified to be good discriminants between BPs and facu- 
lae at all disk positions of our dataset: the width and slope of the 
contrast profiles, as well as the apparent area defined by the seg- 
mentation map. Linear discriminant analysis was then performed 
on a visually-selected reference set of BPs and faculae, yielding 
a single linear combination of the parameters as the discriminant 
variable for all disk positions. Using an appropriate threshold 
and rejection range on this variable, all the segmented features 
were ultimately classified and the relative fractions of BPs and 
faculae at each disk position of our dataset were computed. The 
resulting CLV of these fractions is mostly faculae-dominated ex- 
cept for fi > 0.9, i.e. close to disk center. This is in agreement 
with previous observations, thus suggesting the validity of the 
presented method. We propose that these ubiquitous faculae are 
produced by a hot-wall effect through inclined fie Ids. 

Using our classification method, we plan to present more 
statistical results concerning photometric properties of BPs and 
faculae (such as contrast and morphology) in a forthcoming pa- 
per. A similar classification study should in future also be per- 
formed on a high-resolution dataset with magnetic field vec- 
tor information, in order to determine the magnetic properties 
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of BPs and faculae separately and further validate the method. 
Such datasets can now be obtained from ground-based imaging 

J iolarimeters such as the GFPl (Puschmann et al. 2006), CRISP 
Scharmer et a l. 2008), or IBIS (Cavalhni 2006), and in the near 
future from the SUNRISE ( Gandorfer et al...2007.) stratospheric 
balloon-born observatory. Through their seeing-free quality, the 
images of SUNRISE will be very promising for the application 
of our method, as they will naturally satisfy its requirement of 
homogeneous spatial resolution. In particular, a comparison of 
the classification results with the field inclinations retrieved by 
Stokes profile inversions could be particularly interesting to ver- 
ify the hypothetical association of faculae with inclined fields. 
Ultimately, a similar classification should be performed on syn- 
thetic images computed from 3D MHD simulation boxes at var- 
ious angles. A comparison with the classification obtained from 
observational data would then give more physical insight into the 
relationship between BPs and faculae on one hand, and provide 
novel constraints for the models on the other hand. 
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Fig. 11. G-band image at (//) = 0.97 (the same as in Fig. [T]), with the overlaid contours of the features classified using the reject 
option. The faculae are contoured in green, the BPs in yellow and the rejected features in red. The contours coiTespond to the border 
of the features as defined by the segmentation map (corresponding to the lowest MLT level Cg = 0). Tickmarks are in arcseconds. 
The corresponding original image without contours is presented in Fig.[T] 
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Fig. 12. G-band image at (fi) = 0.6 (the same as in Fig.|2]i, with the overlaid contours of the features classified using the reject option 
(see caption of Fig.fTTTi. The corresponding original image without contours is presented in Fig.|2] 
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Fig. 13. G-band image at (/i) = 0.9, with the overlaid contours of the features classified using the reject option (see caption of Fig. 
fTTT l. The original field of view has been slightly cropped horizontally for this display. The direction of the closest limb is upwards. 
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;. 14. Original G-band image at (ju) = 0.9 without overlaid contours, for comparison with Fig.fTSl 



